APPENDIX 
Descriptions of coalescent generators 



We indicate in this section the general formulae for different types of coalescent models 

describing the rates of coalescence of k lineages when there are n active ancestral lineages in 

total. For a sample size n, each &-tuple of lineages is merging to form a single lineage at rate 

X )% k , and no other transitions are possible. 
i 

\ k =\x k ~ 2 {\-x) n ~ k A{dx) with2<£<rc [Al] 
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Where A is a probability measure on [0,1] which defines the size, i.e. how many lineages 
merge, and probability, that is how often merging of lineages occur, of the coalescent events. 

1 ) Kingman coalescent 

The Kingman coalescent is obtained from eq. Al with: 
A(dx) = 5 0 dx [A2] 

which is the Dirac point mass function at 0. In other words, the Kingman coalescent is defined 
with the rate of coalescence being X, %k = 0 except if k=2 where X n> z=\. Only binary collisions, 
i.e. coalescence of two lineages, are thus possible. 

2) A-coalescent 

The A-coalescent is generally defined in eq. Al where A specifies the size and probability of 
the coalescent events following given specific Dirac mass functions with peculiar 
interpretations. The function 
A(dx) = c8 y dx (c > 0) [A3] 

defines a model where a fraction y of the population is replaced by the offsprings of a single 
ancestor at rate c. This macroscopic reproduction event is called an "extreme reproductive 
behaviour", and leads to multiple collisions in the genealogy (coalescence of more than 2 
lineages). 

Another special extreme case occurs when choosing A = §\dx where the Dirac mass function 
is on one, which generates a star like-genealogy where all n lineages merge to a single 
ancestor. 



3) Beta-coalescent 

This class of coalescent is a special case of A-coalescents for which A has a density define as 
a function Beta(a,2- a) (with 1 < a < 2). Note that the special case where a = 2 corresponds to 
the classic Kingman coalescent above. 

A(dx) = 1 x l ~ a (1 - x) a ~ l dx [A4] 

T{2-a)T{a) 

As for the A-coalescent, the "extreme reproductive behaviour" in eq. A4 leads to multiple 
collisions in the genealogy with coalescence of more than 2 lineages. 

4) Bolthausen-Snitzman 

The Bolthausen-Snitzman coalescent is obtained as a specific case of the A-coalescent where 
A(dx) = Beta(l,l), or in other words, a beta-coalescent with a = 1. In this case, the Dirac 
function has a uniform distribution of mass on [0,1]. 



5) m -coalescent 

Eldon and Wakeley (2006) have developed a model where a single parent chosen at random 
from the population of size N contributes either 1) one offspring with probability 1-s as in the 
classic Wright-Fisher model, or 2) \\iN offsprings with probability s. The latest case generates 
a large reproduction event, and a multiple merging of several lineages. The case where s = 
1/N 7 is considered so that if 0 < y < 2. The function A from eq. Al is: 

A(dx) = (S 0 + cy/ 2 5 v )dx 

This function defines a model where a fraction y/ of the population is replaced by the 
offsprings of a single ancestor at rate c. 

Eldon and Wakeley rewrote eq. Al to define the rate of multiple merger of k lineages among 
n under their model: 



y/ k (\-y/) n - k , with0<y < 1. 



